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Abstract 

During last years poker has gained a lot of prestige in several countries and, beyond to be one of 
the most famous card games, it represents a modern challenge for scientists belonging to different 
communities, spanning from artificial intelligence to physics and from psychology to mathematics. 
Unlike games like chess, the task of classifying the nature of poker (i.e., as ’skill game’ or gambling) 
seems really hard and it also constitutes a current problem, whose solution has several implications. 
In general, gambling offers equal winning probabilities both to rational players (i.e., those that use 
a strategy) and to irrational ones (i.e., those without a strategy). Therefore, in order to uncover the 
nature of poker, a viable way is comparing performances of rational versus irrational players during 
a series of challenges. Recently, a work on this topic revealed that rationality is a fundamental 
ingredient to succeed in poker tournaments. In this study we analyze a simple model of poker 
challenges by a statistical physics approach, with the aim to uncover the nature of this game. As 
main result we found that, under particular conditions, few irrational players can turn poker into 
gambling. Therefore, although rationality is a key ingredient to succeed in poker, also the format 
of challenges has an important role in these dynamics, as it can strongly influence the underlying 
nature of the game. The importance of our results lies on related implications, as for instance in 
identifying the limits poker can be considered as a ‘skill game’ and, as a consequence, which kind 
of format must be chosen to devise algorithms able to face humans. 
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Nowadays, social dynamics and modeling hnman behavior represent challenging topics for 
scientists belonging to different commnnities, e.g., artihcial intelligence, physics, mathemat¬ 
ics and social psychology. Notably, the modern held of sociophysics [T] aims to investigate 
social and economic phenomena by a strongly interdisciplinary approach, mainly based on 
analytical and compntational tools, coming from the framework of statistical physics [H]. 
Moreover, several social issnes as opinion formation, information spreading and social be¬ 
haviors, can be represented and stndied by nsing agent-based models mu often combined 
with the theory of networks [THin]. In this work, we analyze poker games (hereinafter 
simply poker) by the framework of statistical physics (see also [H]). Poker represents one 
of the major challenges for artihcial intelligence and mathematics [IMS] ;for instance, it is 
worth to highlight that, in the recent stndy ca, the ‘heads np’ limit poker (later described) 
has been solved from a game theory perspective. Fnrthermore, poker is a topic of interest 
also for psychologists, economists and sociologists [16] dne to its wide dihnsion over several 
conntries. One of the most controversial aspects of poker, cansed by the ntilization of money, 
is related to its natnre, i.e., ‘skill game’ or gambling. The related answer has not yet been 
solved na, althongh it has a long list of implications [T8l [T9] . Furthermore, all ehorts made 
to dehne algorithms and strategies in the context of artihcial intelligence are obviously based 
on the conhdent belief that computing skills are relevant to succeed in poker. Therefore, 
our investigations aim to shed some light on the nature of this game. In principle, there 
are several variants of poker, e.g., Texas Hold’em, Omaha, Draw, etc., each having its own 
rules. However, they all follow a similar logic: a number of cards is distributed among 
players, who in turn decide if to play or not, evaluating the possible combinations of their 
cards (called hand) with those on the table. Since players cannot see the cards of their 
opponents, when they have to take an action (e.g., to bet money), poker is an imperfect 
information game, unlike others like chess where all players get all the system information 
simultaneously [2U] . It is worth to observe that the utilization of money makes the challenge 
meaningful, just because the underlying dynamics of poker are constituted by a series of 
bets. Hence, without money players would have no reasons to fold their hands. In general, 
there are two main formats for playing poker, i.e., tournament and ‘cash game’. The former 
entails players pay an entry fee that goes into the prize pool plus a fee to play, receiving 
an amount of chips. Then, top players share the prize pool. Instead, playing poker in the 
‘cash game’ format entails to use real money during the challenge. Therefore, in this last 
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case, players can play until they have money and, although there are no entry fees to pay, 
a fraction of each pot is taxed (i.e., a small ‘rake’ is applied). In the work [ 22 ], the author 
dehned a model for representing poker challenges, focusing on tournaments, in order to 
study the role of rationality. His main result was that the nature of poker does not depend 
on its rules but on the players’s behavior, then identifying rationality as a key ingredient 
to succeed. Hence, since the human behavior has such important role in poker, we perform 
further investigations on this direction, but considering the ‘cash game’ format. 

Let us now briefly recall the model described in [ 22 ] and summarize the main achieve¬ 
ments. This model represents ‘heads-up’ challenges, i.e., challenges that involve two players 
at a time. Players can be rational or irrational. The former move (e.g., bet and fold) by using 
the Sklansky table 1231 as reference, whereas the latter play randomly. It is worth to note 
that, for the sake of simplicity, each round is composed of only one betting phase (instead, in 
real scenarios, usually there are more phases [ 23 ])- Numerical simulations showed that, under 
these conditions, rational players win a challenge against irrational players with probability 
7r)f ~ 0.8. Hence, a rational player is supposed to win about three consecutive challenges 
{W = 3) against an irrational one. As a consequence, since ‘heads-up’ tournaments have a 
tree-like structure, the hnal winner is a rational player when the number of total participants 
N, regardless of their behavior, is < 2^. After analyzing poker tournaments by different 
conditions (e.g., also allowing rationals to change behavior), the author [22] states that the 
nature of poker depends on the players’ behavior, but not on its rules. 

Here, we focus our attention on the ‘cash game’ format. It is important to observe 
that each ‘heads-up’ challenge can last from one to several rounds, in principle depending 
on the amount of money opponents have available. Moreover, even after a single round one 
player can leave the table (i.e., ending the challenge) with her/his remaining money. In order 
to study this scenario, we consider a population of agents that interact by the dynamics of 
the classical voter model m In so doing, each agent has a state that represents its behavior 
(i.e., rational or irrational) and, at each time step, two randomly chosen agents interact, i.e., 
they play a poker challenge. Notably, we map agent states as follows: cr = -|-1 for rational 
agents and a = —1 for irrational agents. Furthermore, we assume that a rational player wins 
a full challenge against an irrational one with the probability vr/’ dehned in [ 22 ]- Therefore, 


3 


the stochastic process of a poker challenge, involving players x and y, is reduced to a coin 
flip with winning probabilities 

{ TT^ = 0.8 if (x! = y and = +1) ^ ^ 

TT^ = 0.5 if (x = y) 

and TTy=l — TT^. Then, according to voter model-like dynamics, after each interaction the 
loser assumes the state (i.e., the behavior) of the winner. Since, as said before, a cash game 
challenge can last from 1 to n rounds, we have two limit cases: 

1. a) n = oo\ an interaction corresponds to a full challenge, as after an inhnite number 
of rounds one player prevails; 

2. b) n = 1: an interaction corresponds to only one round. 

In both cases, agents start a new challenge always with the same amount of money (called 
‘starting stack’), regardless of their previous results. Furthermore, in the cash game format 
the minimal amount of a bet (called ‘big blind’) does not change over time. In the pro¬ 
posed model, we set the ‘starting stack’ to 10000 and a ‘big blind’ to 100 so, considering 
the dynamics of case a, the ratio between these two parameters only affects the length of 
challenges (see also |22]), while in the case b it has no influences. It is worth to highlight 
that both a and b refer to real scenarios. In particular, the case b represents the so called 
‘rush poker’, available in several online platforms. Moreover, we highlight that for the case 
bEq[T] cannot be used, since it holds only for a full challenge. Anyway, also for the case b, 
it would be possible to derive the winning probabilities for rational agents by analyzing the 
Sklansky table [23] . 


Now, we introduce a mean held approximation |2S] of the proposed model considering, 
in particular, the case a as the winning probability of rational agents is dehned (see equa¬ 
tion [^. Since agents can change state over time, i.e., from rational to irrational and vice 
versa, the following equations describe the dynamics of the population 

= a ■ pr{t) ■ pi{t) - b ■ Pi{t) ■ Pr{t) 

^ ^ ■ Prit) - « ■ Prit) ■ Piit) 

^prit)+ pi{t) = 1 
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with pr and pi density of rational and irrational agents, respectively. Parameters a and b 
represent the winning probabilities of each species (i.e., rational and irrational), then a = 
and b = 1 — TT^. In so doing, we are studying the system by a compartmental approach, as 
in SIS-like models adopted in computational epidemics izg. As result we achieve a RIR 
model (i.e.. Rational —)■ Irrational —)■ Rational) whose solutions, computed by integrating 
between 0 and (as we are considering an asynchronous dynamics) are 



(3) 


with r = {a — b) which justihes the minus in the exponent of the second equation in the 


system]^ Panels a,b,c of hgurej^show pr{t) and pi{t) over time, according to Equations 


and the value of the system magnetization M dehned as follows [23] 

I I 

M = (4) 

Now, we focus our attention on the behavior of the system for t ^ oo: for r > 0, we obtain 
Pi ^ 0 and Pr = 1, whereas for r < 0, the opposite happens, i.e., p* —)• 1 and pr = 0. In the 
case r = 0, as f —>■ oo, the hnal equilibrium state corresponds to the co-existence of rational 
and irrational agents, whose hnal values are Pr(0) and Pi(0), respectively. It is worth to note 
that, this third equilibrium can be obtained also by solving the system of equations 0 by 
setting = 0 and = 0 as, in doing so, we hnd a = b. Remarkably, the system 

behavior can be described by the Curie-Weiss model [25] (hereinafter CW). Notably, the 
agent population for r = 0 behaves as a spin system at T > Tc (i.e., whose temperature is 
greater than the Curie or ‘critical’ temperature), where the system equilibrium corresponds 
to a disordered phase. Instead, in the cases r > 0 and r < 0 the population behaves as 
a spin system at T < having two possible equilibria, both corresponding to an ordered 
phase. Therefore, for r equal to zero, the agent population is in a paramagnetic phase while, 
for values of r greater or lower than zero, it is in a ferromagnetic phase. A possible solution, 
to analytically describe the agent population by using the CW model, can be devised by 
mapping r to an external magnetic held h. In doing so, we can study the system equilibria 
by analyzing the free energy F. Notably, according to the Landau mean held theory [27] . 
the free energy F, as a function of the order parameter m, can be dehned as 


F{m) = —hm -|- a{T)m^ -|- 


(5) 
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FIG. 1. From a to c: evolution of the system over time, according to the proposed RIR model. 
The red dotted line represents the amount of irrational agents, the black dotted line represents 
the amount of rational agents, and the continuous blue line indicates the system magnetization 
M . a Results achieved by Pr{Q) = 0.1. b Results achieved by Pr{Q) = 0.5. c Results achieved by 
/Or(0) = 0.9. From d to f: Free energy F as a function of the order parameter m, representing 
the behavior of the agent population on varying r (i.e., the winning probabilities of rational and 
irrational agents), d Free energy, for T > Tc, related to the case r = 0, and with Pr{G) = 0.5. e 
Free energy, for T < Tc, related to the case r < 0. f Free energy, for T < Tc, related to the case 
r > 0. The order parameter m has a domain defined in [—1, +1], thus functions representing F{m) 
are indicated by continuous lines inside the domain of m, and by dotted lines outside the domain. 
The dotted lines have been added in order to improve the quality of the pictorial representation of 
the minima of F(m). 


It is strongly important to note that, in a classical spin system, the magnetization m spans 
from —1 to +1, and it is computed as follows 


m = 



( 6 ) 


with N number of spins, whose value can be a = ±1. Therefore, the dehnition of the 
magnetization M in equation (usually adopted in the context of opinion dynamics) and 
that of m, dehned in equation coincide for less than the absolute value adopted in the 
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former, i.e., M = \m\. Now, the extrema of Eq. can be found by computing solutions of 
^ = 0. In particular, for h = 0, the two solutions are m = (0, i-y/—Here, a = 
with c small positive constant, and u positive parameter, that we set to m = |. It is worth 
to observe that a(T) becomes zero as T —)■ Tc, i.e., as the system temperature approaches 
the ‘critical’ temperature. For the paramagnetic phase (i.e., T > Tc) we can set a = while 
for the ferromagnetic phase we can set a = — |. Then, Equation [] becomes 

9 4 

F{m) = —hm ± — + — (7) 

where the sign of the second term depends on the system temperature, i.e., positive for 
T > Tc and negative for T < Tc- We recall that, in the paramagnetic phase, there is a 
unique minimum of free energy for m = 0 —see panel d of hgure On the other hand, 
in the ferromagnetic phase there are two minima of free energy, which correspond to ±1. 
Remarkably, for values of h greater or lower than zero, one of the two minima becomes 
an absolute minimum of free energy: for h > 0 the minimum corresponding to m = +1 
becomes deeper than that corresponding to m = — 1, while the opposite happens for h < 0 
(i.e., F{—1) < F{+1)) —see panel e,f of hgure Then, we can study by an analytical 
approach the outcomes of the proposed model in the case a, on varying the value of the win¬ 
ning probability of rational agents tt^. We observe that the same approach cannot be used 
to analyze the case b, of the proposed model, as the winning probabilities are not dehned a 
priori as for the case a. Therefore, the expected behavior of the case b is more complex, and 
we hypothesize that its dynamics can show the presence of bifurcations. Notably, since the 
Sklansky table [23] suggests to play usually with a small set of hands, many rounds will be 
won by irrational agents due to several ‘fold’ actions performed by rational ones. As result, 
it is possible that even for a high initial density of rational agents, sometimes few irrational 
agents prevail then, for the same Pr(0), the hnal state of the population can be both -|-1 
(rational) and —1 (irrational), i.e., a bifurcation emerges. 

The proposed model is now studied by numerical simulations. In particular, we con¬ 
sider populations of different size, from N = 100 to 77 = 1000 agents, and we perform for 
each case 100 simulation runs. Since we aim to compare performances of rational versus irra¬ 
tional agents, each simulation lasts until all agents converge to the same behavior (i.e., state). 
As shown before for the RIR model, it is possible to analyze the evolution of the system. 
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FIG. 2. Evolution of the magnetization on varying the initial density of rational agents /5r-(0). a 
Results achieved by implementing the case a: agents play full challenges, b Results achieved by 
implementing the case b: agents play single rounds. Results have been averaged over 100 different 
simulation runs. 


for different initial densities of rational agents, by studying the magnetization M. We recall 
that the value of M, according to Equation]^ ranges between 0 and 1 (i.e., 0 < M < 1). 
When M ~ 0, the system is in a disordered phase as there is the same amount of agents in 
the two states, whereas as M —)■ 1 the system reaches an ordered phase, characterized by 
the presence of a prevailing state (a = +1 or a = —1). Figure shows the magnetization 
over time, achieved in the two considered cases (i.e.,a and b). Notably, both varying the 
density Pr'(O) and considering the two cases, the agent population always converges to the 
same state, in full accordance with the analytical predictions shown in Figure (for the 
case a). Before to proceed with further analyses, it is worth to spend few words in order to 
explain why the magnetization M in the panel a of hgure does not reach zero as in the 
related analytical solution (see panel a of figure [^, i.e., for Pr(0) = 0.1. Notably, recalling 
that values of M achieved in numerical simulations have been averaged over different runs, 
at each single attempt the time step t corresponding to M = 0 may vary, as we are dealing 
with a stochastic process, hence by averaging all results the average minimum value is not 
zero. At this point, it is worth to investigate the hnal population state (S), in order to know 
whether, after all challenges, agents play rationally (i.e., S = +1) or not (i.e., S = —1). 
Thus, we analyze the amount of rational agents over time *S'(t), for different initial densities 
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FIG. 3. Summation of states over time, a Results achieved by implementing the case a: agents 
play full challenges, b. Results achieved by implementing the case b: agents play single rounds. 
Results have been averaged over 100 different simulation runs. 


Pr(0) —see figure Remarkably, since values of S{t) are averaged over different simulation 
runs, and by knowing that at each attempt the population reaches an ordered phase, we 
may derive the probability that rational agents prevail on irrational ones on varying 
Pr(0). Notably, these winning probabilities have been computed for different values of Pr(0) 
from 0 to 1, focusing on small values close to 0 (e.g., 0.0033, 0.01, 0.05) for the case a and 
on high values close to 1 (e.g., 0.97, 0.98, 0.99) for the case b. The main reason to explore 
in particular low Pr(0) for the hrst case and high Pr(0) for the second case lies in the fact 
that, observing £gure[^ we found that rational agents easily prevail playing full challenges 
(i.e., a) against irrational agents that, in turn, prevail many times playing single rounds 
(i.e., b). We want to highlight that results shown in panel a of hgure are completely in 


9 























1.0 



FIG. 4. Probability that rational agents prevail (P^) on varying pr{0). In the legend, A refers to 
the case a and B to the case b. The two black lines (i.e., the dotted and the continuous one) refer 
to the computed fitting functions. The green continuous line separates the upper side of the plane, 
i.e., the ‘skill game’ area, from the lower side, i.e., the gambling area. 

accordance with the analytical solntion, as simnlations of case a have been always performed 
with T > 0 as 7r)f = 0.8 (see panelf of £gnre[^; moreover, as we hypothesized before, a more 
complex behavior emerges in the case b. In particnlar, when agents play single ronnds, the 
valne of M only increases np to 1, bnt considering the snmmation S{t), we observe that 
few irrational agents can sometimes turn into irrational the whole population even for a 
high initial density of rational agents. Eventually, hgure further highlights the detected 
differences between the two considered scenarios. Notably, we computed fitness functions 
for both cases, identifying a simple step function for a, and the function 

Cr(Pr(0)) = ^(0)’'= (8) 

for b. It is worth to recall that the function defined in equation allows to fit results of 
simulations (in the case b), and it has not been dehned by the analytical approach. On 
one hand, it is interesting to observe that in full challenges even the presence of only one 
rational agent can entails the transition to an ordered ‘rational’ phase. On the other hand, 
when playing single rounds, rational agents prevail with a probability greater than 50% only 
if Pr(0) > 0.7 hence, in our opinion, poker in this last case can be considered as gambling. 

All these results conhrm that classifying the nature of poker is a tricky task, as a lot 
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of conditions must be considered in real scenarios. In particular, according to the proposed 
model, although in tournaments it seems rationality be a key ingredient to succeed [2H], 
in the cash game format it may be sometimes appropriate to associate poker to gambling. 
Moreover, considering all risks of poker in the cash game format (see [2H]), we think both 
players and scientists working on poker be aware of our results. A further important point to 
discuss, before to conclude, is related to the validity of out model in real scenarios. Notably, 
although it would be extremely interesting to compare outcomes of the proposed model with 
real data, this is not possible as no similar datasets exist. Anyway it may be possible to 
evaluate if a player is adopting mainly a random strategy or a rational one as, according 
to the rules [23], often players have to show their hands after the round to discover who 
is the winner. Finally, we deem the importance of our results lies on related implications. 
Notably, we found that not only the player’s behavior but also the format of poker must 
be considered when classifying the nature of this game, showing that there are well dehned 
limits poker can be considered as a ‘skill game’. 
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